Dynamic File-access Characteristics of a Production Parallel Scientiic Workload
نویسندگان
چکیده
Multiprocessors have permitted astounding increases in computational performance, but many cannot meet the intense I/O requirements of some scientiic applications. An important component of any solution to this I/O bottleneck is a parallel le system that can provide high-bandwidth access to tremendous amounts of data in parallel to hundreds or thousands of processors. Most successful systems are based on a solid understanding of the expected workload, but thus far there have been no comprehensive workload characterizations of multiprocessor le systems. This paper presents the results of a three week tracing study in which all le-related activity on a massively parallel computer was recorded. Our instrumentation diiers from previous eeorts in that it collects information about every I/O request and about the mix of jobs running in a production environment. We also present the results of a trace-driven caching simulation and recommendations for designers of multiprocessor le systems.
منابع مشابه
File-Access Characteristics of Parallel Scientific Workloads
Phenomenal improvements in the computational performance of multiprocessors have not been matched by comparable gains in I/O system performance. This imbalance has resulted in I/O becoming a significant bottleneck for many scientific applications. One key to overcoming this bottleneck is improving the performance of parallel file systems. The design of a high-performance parallel file system re...
متن کاملA Metadata Workload Generator for Data-Intensive File Systems
Large-scale data-intensive computing [2, 3] has posed numerous challenges to the underlying distributed file system, due to the unprecedented amount of data, the large number of users, the intense competition on cost and service quality, and the emergence of new applications. As a result, there has been an increasing amount of research on scalable metadata management [4, 6], high availability [...
متن کاملOn the Beneets and Limitations of Dynamic Partitioning in Parallel Computer Systems
In this paper we analyze the beneets and limitations of dynamic partitioning across a wide range of parallel system environments. We formulate a general model of dynamic partitioning that can be t-ted to measurement data to obtain a suuciently accurate quantitative analysis of real parallel systems executing real scientiic and/or commercial workloads. An exact solution of the model is obtained ...
متن کاملFile System Workload Analysis For Large Scale Scientific Computing Applications
Parallel scientific applications require high-performance I/O support from underlying file systems. A comprehensive understanding of the expected workload is therefore essential for the design of high-performance parallel file systems. We re-examine the workload characteristics in parallel computing environments in the light of recent technology advances and new applications. We analyze applica...
متن کاملFile System Workload Analysis For Large Scientific Computing Applications
Parallel scientific applications require high-performance I/O support from underlying file systems. A comprehensive understanding of the expected workload is therefore essential for the design of high-performance parallel file systems. We re-examine the workload characteristics in parallel computing environments in the light of recent technology advances and new applications. We analyze applica...
متن کامل